A System for Summarizing Scientific Topics Starting from Keywords

نویسندگان

  • Rahul Jha
  • Amjad Abu-Jbara
  • Dragomir R. Radev
چکیده

In this paper, we investigate the problem of automatic generation of scientific surveys starting from keywords provided by a user. We present a system that can take a topic query as input and generate a survey of the topic by first selecting a set of relevant documents, and then selecting sentences from those documents. We discuss the issues of robust evaluation of such systems and describe an evaluation corpus we generated by manually extracting factoids, or information units, from 47 gold standard documents (surveys and tutorials) on 7 topics in Natural Language Processing. We have manually annotated 2,625 sentences with these factoids (around 375 sentences per topic) to build an evaluation corpus, which we use for comparative evaluation of our system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Precision of Keywords Extracted From Persian Text Using Word2Vec Algorithm

Keywords can present the main concepts of the text without human intervention according to the model. Keywords are important vocabulary words that describe the text and play a very important role in accurate and fast understanding of the content. The purpose of extracting keywords is to identify the subject of the text and the main content of the text in the shortest time. Keyword extraction pl...

متن کامل

ارائه روشی جدید برای شاخص‌گذاری خودکار و استخراج کلمات کلیدی برای بازیابی اطلاعات و خوشه‌بندی متون

Persian words in writing with a diverse and cover all modes of grammatical words with the recruitment of a series of specific rules because it is impossible to extract keywords automatically from Persian texts difficult and complex. This thesis has attempted to use linguistic information and thesaurus, keywords Mnatry be provided. Using the symbol system is structured network can be keywords, i...

متن کامل

Regular Sleep Schedule: A Way to Prevent Emotional/Behavioral Problems in Children

Abstract Background: Prevention of children's behavioral problems is more important and necessary for parents, teachers, educators, and psychologists. In this article, behavioral-emotional disorders have been defined. Then, to prevent behavioral problems in children, Regular sleep planning is suggested, Finally, the conclusion was made. Method: Data analysis in the present study was done i...

متن کامل

The online attention to certain nuclear medicine topics: An altmetrics study vs. a citation analysis

Introduction: Traditional citation analysis has been greatly criticized because the process of citation accumulation requires considerable time after publication. So, the term “altmetrics” was proposed in 2010 to measure the scientific and social impact of a paper.We performed a search for certain nuclear medicine topics using the altmetrics approach to report the correlation b...

متن کامل

NLP Driven Models for Automatically Generating Survey Articles for Scientific Topics

This thesis presents new methods that use natural language processing (NLP) driven models for summarizing research in scientific fields. Given a topic query in the form of a text string, we present methods for finding research articles relevant to the topic as well as summarization algorithms that use lexical and discourse information present in the text of these articles to generate coherent a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013